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In the past few years, lossy compression has been widely applied in the field of wireless sensor networks (WSN), where energy 
efficiency is a crucial concern due to the constrained nature of the transmission devices. Often, the common thinking among 
researchers and implementers is that compression is always a good choice, because the major source of energy consumption in a 
sensor node comes from the transmission of the data. Lossy compression is deemed a viable solution as the imperfect reconstruction 
of the signal is often acceptable in WSN, subject to some application dependent maximum error tolerance. Nevertheless, this is 
seldom supported by quantitative evidence. In this paper, we thoroughly review a number of lossy compression methods from the 
literature, and analyze their performance in terms of compression efficiency, computational complexity and energy consumption. 
We consider two different scenarios, namely, wireless and underwater communications, and show that signal compression may or 
may not help in the reduction of the overall energy consumption, depending on factors such as the compression algorithm, the signal 
statistics and the hardware characteristics, i.e., micro- controller and transmission technology. The lesson that we have learned, is 
that signal compression may in fact provide some energy savings. However, its usage should be carefully evaluated, as in quite a 
few cases processing and transmission costs are of the same order of magnitude, whereas, in some other cases, the former may even 
dominate the latter. In this paper, we show quantitative comparisons to assess these tradeoffs in the above mentioned scenarios (i.e., 
wireless versus underwater). In addition, we consider recently proposed and lightweight algorithms such as Lightweight Temporal 
Compression (LTC) as well as more sophisticated FFT- or DCT-based schemes and show that the former are the best option in 
wireless settings, whereas the latter solutions are preferable for underwater networks. Finally, we provide formulas, obtained 
through numerical fittings, to gauge the computational complexity, the overall energy consumption and the signal representation 
accuracy of the best performing algorithms as a function of the most relevant system parameters. 

Keywords: Wireless Sensor Networks, Underwater Sensor Networks, Lossy Data Compression, Temporal Data Compression, 
Computational Complexity, Performance Evaluation 
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1. Introduction 

In recent years, wireless sensors and mobile technologies 
have experienced a tremendous upsurge. Advances in hardware 
design and micro-fabrication have made it possible to poten- 
tially embed sensing and communication devices in every ob- 
ject, from banknotes to bicycles, leading to the vision of the 
Internet of Things (IoT) [1]. It is expected that physical objects 
in the near future will create an unprecedented network of in- 
terconnected physical things able to communicate information 
about themselves and/or their surroundings and also capable of 
interacting with the physical environment where they operate. 

Wireless Sensor Network (WSN) technology has now 
reached a good level of maturity and is one of the main enablers 
for the IoT vision: notable WSN application examples include 
environmental monitoring [2[], geology ]3|] structural monitor- 
ing |01, smart grid and household energy metering |q1|60. The 
basic differences between WSN and IoT are the number of de- 
vices, that is expected to be very large for IoT, and their capa- 
bility of seamlessly communicating via the Internet Protocol, 
that will make IoT technology pervasive. 



The above mentioned applications require the collection and 
the subsequent analysis of large amounts of data, which are to 
be sent through suitable routing protocols to some data collec- 
tion point(s). One of the main problems of IoTs is thus related 
to the foreseen large number of devices: if this number will 
keep increasing as predicted in H, and all signs point toward 
this direction, the amount of data to be managed by the net- 
work will become prohibitive. Further issues are due to the 
constrained nature of IoT devices in terms of limited energy 
resources (devices are often battery operated) and to the fact 
that data transmission is their main source of energy consump- 
tion. This, together with the fact that IoT nodes are required to 
remain unattended (and operational) for long periods of time, 
poses severe constrains on their transmitting capabilities. 

Recently, several strategies have been developed to prolong 
the lifetime of battery operated IoT nodes. These comprise pro- 
cessing techniques such as data aggregation [8], distributed |@] 
or temporal 11 1 Oil compression as well as battery replenishment 
through energy harvesting lUTl . The rationale behind data com- 
pression is that we can trade some additional energy for com- 
pression for some reduction in the energy spent for transmis- 
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sion. As we shall see in the remainder of this paper, if the en- 
ergy spent for processing is sufficiently smaller than that needed 
for transmission, energy savings are in fact possible. 

In this paper we focus on the energy saving opportunities of- 
fered by data processing and, in particular, on the effectiveness 
of the lossy temporal compression of data. With lossy tech- 
niques, the original data is compressed by however discarding 
some of the original information in it, so that at the receiver 
side the decompressor can reconstruct the original data up to 
a certain accuracy. Lossy compression makes it possible to 
trade some reconstruction accuracy for some additional gains 
in terms of compression ratio with respect to lossless schemes. 
Note that 1) these gains correspond to further savings in terms 
of transmission needs, 2) depending on the application, some 
small inaccuracy in the reconstructed signal can in fact be ac- 
ceptable, 3) given this, lossy compression schemes introduce 
some additional flexibility as one can tune the compression ra- 
tio as a function of energy consumption criteria. 

We note that much of the existing literature has been devoted 
to the systematic study of lossless compression methods. [12] 
proposes a simple Lossless Entropy Compression (LEC) algo- 
rithm, comparing LEC with standard techniques such as gzip, 
bzip2, rar and classical Huffman and arithmetic encoders. A 
simple lossy compression scheme, called Lightweight Tempo- 
ral Compression (LTC) [13], was also considered. However, 
the main focus of this comparison has been on the achievable 
compression ratio, whereas considerations on energy savings 
are only given for LEC. lfl4ll examines Huffman, Run Length 
Encoding (RLE) and Delta Encoding (DE), comparing the en- 
ergy spent for compression for these schemes, fl 1 311 treats lossy 
(LTC) as well as lossless (LEC and Lempel-Ziv- Welch) com- 
pression methods, but only focusing on their compression per- 
formance. Further work is carried out in Q16[, where the energy 
savings from lossless compression algorithms are evaluated for 
different radio setups, in single- as well as multi-hop networks. 
Along the same lines, II 1 711 compares several lossless compres- 
sion schemes for a StrongArm CPU architecture, showing the 
unexpected result that data compression may actually cause an 
increase in the overall energy expenditure. A comprehensive 
survey of practical lossless compression schemes for WSN can 
be found in 111 811 . The lesson that we learn from these papers 
is that lossless compression can provide some energy savings. 
These are however smaller than one might expect because, for 
the hardware in use nowadays (CPU and radio), the energy 
spent for the execution of the compression algorithms (CPU) 
is of the same order of magnitude of that spent for transmission 
(radio). 

Some further work has been carried out for what concerns 
lossy compression schemes. LTC [ 15], PLAMLiS 01911 and the 
algorithm of ll20ll are all based on Piecewise Linear Approxi- 
mation (PLA). Adaptive Auto-Regressive Moving Average (A- 
ARMA) HI is instead based on ARMA models (these schemes 
will be extensively reviewed in the following Section Nev- 
ertheless, we remark that no systematic energy comparison has 
been carried out so far for lossy schemes. In this case, it is 
not clear whether lossy compression can be advantageous in 
terms of energy savings and what the involved tradeoffs are 
in terms of compression ratio vs representation accuracy and 



yet how these affect the overall energy expenditure. In ad- 
dition, it is unclear whether the above mentioned linear and 
autoregressive schemes can provide at all advantages as com- 
pared with more sophisticated techniques such as Fourier-based 
transforms, which have been effectively used to compress au- 
dio and video signals and for which fast and computationally 
efficient algorithms exist. In this paper, we fill these gaps by 
systematically comparing existing lossy compression methods 
among each other and against polynomial and Fourier-based 
(FFT and DCT) compression schemes. Our comparison is car- 
ried out for two wireless communication setups, i.e., for radio 
and acoustic modems (used for underwater sensor networking, 
see, e.g., B22I1 ) and fitting formulas for the relevant performance 
metrics are obtained for the best performing algorithms in both 
cases. 

Specifically, the main contributions of this paper are: 

• we thoroughly review lossy compression methods from 
the literature as well as polynomial, FFT- and DCT-based 
schemes, quantifying their performance in terms of com- 
pression efficiency, computational complexity (i.e., pro- 
cessing energy) and energy consumption for two radio 
setups, namely, wireless (IEEE 802.15.4) and underwa- 
ter (acoustic modems) radios. For FFT- and DCT-based 
methods we propose our own algorithms, which exploit 
the properties of these transformations. 

• We assess whether signal compression may actually help 
in the reduction of the overall energy consumption, de- 
pending on the compression algorithm, the chosen recon- 
struction accuracy, the signal statistics and the transmis- 
sion technology (i.e., wireless versus underwater). In fact, 
we conclude that signal compression may be helpful; how- 
ever, in quite a few cases processing and transmission costs 
are of the same order of magnitude. Also, in some other 
cases, the former may even dominate the latter. Notably, 
our results indicate that PLA methods (and in particu- 
lar among them LTC) are the best option for wireless ra- 
dios, whereas DCT-based compression is the algorithm of 
choice for acoustic modems. Thus, the choice of the com- 
pression algorithm itself is highly dependent on the energy 
consumption associated with radio transmission. 

• We provide formulas, obtained through numerical fittings 
and validated against real datasets, to gauge the computa- 
tional complexity, the overall energy consumption and the 
signal representation accuracy of the best performing algo- 
rithms in each scenario, as a function of the most relevant 
system parameters. These can be used to generalize the 
results obtained here for the selected radio setups to other 
architectures. 

The rest of the paper is organized as follows. Section |2] 
discusses modeling techniques from the literature, along with 
some lossy compression schemes that we introduce in this pa- 
per. In Section [3] we carry out a thorough performance evalu- 
ation of all considered methods, whereas our conclusions are 
drawn in Section|4] 
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2. Lossy Compression for Constrained Sensing Devices 

In the following, we review existing lossy signal compression 
methods for constrained sensor nodes, we present an improved 
ARMA-based compressor and we apply well known FFT and 
DCT techniques to achieve efficient lossy compression algo- 
rithms. We start with the description of what we refer to here 
as "adaptive modeling techniques" in Section IXTl Hence, in 
Section FZ2l we discuss techniques based on Fourier transforms. 

2.1. Compression Methods Based on Adaptive Modeling 

For the Adaptive Modeling schemes, some signal model is 
iteratively updated over pre-determined time windows, exploit- 
ing the correlation structure of the signal through linear, poly- 
nomial or autoregressive methods; thus, signal compression is 
achieved by sending the model parameters in place of the orig- 
inal data. 

2.1.1. Piecewise Linear Approximations (PLA) 

The term Piecewise Linear Approximation (PLA) refers to a 
family of linear approximation techniques. These build on the 
fact that, for most time series consisting of environmental mea- 
sures such as temperature and humidity, linear approximations 
work well enough over short time frames. The idea is to use a 
sequence of line segments to represent an input time series x(n) 
with a bounded approximation error. Further, since a line seg- 
ment can be determined by only two end points, PLA leads to 
quite efficient representations of time series in terms of memory 
and transmission requirements. 



x(n) 



n 



12 3 4 

Figure 1 : Approximation of a time series x(n) by a segment. 

For the reconstruction at the receiver side, at the generic time 
n observations are approximated through the vertical projection 
of the actual samples over the corresponding line segment (i.e., 
the white-filled dots in Fig. [TJ. The approximated signal in 
what follows is referred to as x(n). The error introduced is 
the distance from the actual samples (i.e., the black dots in 
the figure) to the segment along this vertical projection, i.e., 
\x(n) - x(n)\. Most PLA algorithms use standard least squares 
fitting to calculate the approximating line segments. Often, a 
further simplification is introduced to reduce the computation 
complexity, which consists of forcing the end points of each 
line segment to be points of the original time series x(n). This 
makes least squares fitting unnecessary as the line segments are 
fully identified by the extreme points of x(n) in the considered 
time window. Following this simple idea, several methods have 
been proposed in the literature. Below we review the most 
significant among them. 



Lightweight Temporal Compression (LTC) Q: the LTC al- 
gorithm is a low complexity PLA technique. Specifically, let 
x(n) be the points of a time series with n = 1, 2, . . . , N. The 
LTC algorithm starts with n — 1 and fixes the first point of 
the approximating line segment to x( \). The second point x(2) 
is transformed into a vertical line segment that determines the 
set of all "acceptable" lines Q12 with starting point x(l). This 
vertical segment is centered at x{2) and covers all values meet- 
ing a maximum tolerance s > 0, i.e., lying within the interval 
[x(2) - s, x(2) + s], see Fig. |2(a)| The set of acceptable lines for 
n = 3, £2i,2,3> is obtained by the intersection of Q1.2 and the set 
of lines with starting point x(l) that are acceptable for x(3), see 
Fig. |2(b)| If x(3) falls within Qi.2,3 the algorithm continues with 
the next point x(4) and the new set of acceptable lines £2; 2,3,4 
is obtained as the intersection of Q 1,2,3 and the set of lines with 
starting point x(l) that are acceptable for x(4). The procedures 
is iterated adding one point at a time until, at a given step s, x(s) 
is not contained in Qi,2,..., s . Thus, the algorithm sets x(l) and 
x(s - 1) as the starting and ending points of the approximating 
line segment for n - 1,2,. ..,5-1 and starts over with x(s - 1) 
considering it as the first point of the next approximating line 
segment. In our example, 5 = 4, see Fig. |2(c)| 
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Figure 2: Lightweight Temporal Compression example. 

When the inclusion of a new sample does not comply with 
the allowed maximum tolerance, the algorithm starts over 
looking for a new line segment. Thus, it self-adapts to the 
characteristics of x(n) without having to fix beforehand the 
lapse of time between subsequent updates. 

PLAMLiS OHl: as LTC, PLAMLiS represents the input data 
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series x(n) through a sequence of line segments. Here, the linear 
fitting problem is converted into a set-covering problem, trying 
to find the minimum number of segments that cover the entire 
set of values over a given time window. This problem is then 
solved through a greedy algorithm that works as follows: let 
x(n) be the input time series over a window n - 1,2,..., N, 
with A' = \x(l), x(2), . . . , x(N)}. For each x(i) e A" segments 
are built associating x(i) with x(j) (J > ;), which is the farthest 
away from x(i) such that the line segment (x(i), x(j)) meets the 
given error bound e. That is, the difference between the com- 
pressed signal x(k) and x(lc) is no larger than s for i < k < j . 
Let T; denote the subset consisting of all the points covered by 
this line segment, formally: 

Ti — {x{k) e X,i < k < j, s.t., j - i is maximized, 
given \x(k) - x(k)\ < e, V i < k < j) . 

After having iterated this for all points in A we obtain set 
f = {Ti,Ti,---, Tn}- Now, the PLAMLiS problem amounts 
to picking the least number of subsets from T that cover all 
the elements in A, which is the minimum set cover problem 
and is known to be NP-complete. The authors of lfl9ll suggest 
an approximate solution to it through a greedy algorithm. Set 
T is scanned by picking the subset T\ that covers the largest 
number of still uncovered points in A, this set is then removed 
from T and added to the empty set S, i.e., T <— T \T„ 
S <— SUfi and the algorithm is reiterated with the new set T 
until all points in A are covered. The subsets in S define the 
approximating segments for x(n). 

Enhanced PLAMLiS several refinements to PLAMLiS 
have been proposed in the literature to reduce its computational 
cost. In 112011 a top-down recursive segmentation algorithm is 
proposed. As above, consider the input time series x{n) and a 
time window n = 1,2, ...,N. The algorithm starts by taking 
a first segment (x(l), x(N)), if the maximum allowed tolerance 
s is met for all points along this segment the algorithm ends. 
Otherwise, the segment is split in two segments at the point 
x(i), 1 < i < N, where the error is maximum, obtaining the two 
segments (x(l), x(i)) and (x(i), x(N)). The same procedure is re- 
cursively applied on the resulting segments until the maximum 
error tolerance is met for all points. 

2.1.2. Polynomial Regression (PR) 

The above methods can be modified by relaxing the con- 
straint that the endpoints of the segments x(l) and x{N) must 
be actual points from x{n). In this case, polynomials of given 
order p > 1 are used as the approximating functions, whose co- 
efficients are found through standard regression methods based 
on least squares fitting [23]. Specifically, we start with a win- 
dow of p samples, for which we obtain the best fitting polyno- 
mial coefficients. Thus, we keep increasing the window length 
of one sample at a time, computing the new coefficients, while 
the target error tolerance is met. 

However, tracing a line between two fixed points as done by 
LTC and PLAMLiS has a very low computational complexity, 
while least squares fitting can have a significant cost. Polyno- 
mial regression obtains better results in terms of approxima- 
tion at the cost of higher computational complexities, which 



increase with the polynomial order, with respect to the linear 
models of Section l2.1.1l 

2.1.3. Auto-Regressive (AR) Methods 

Auto Regressive (AR) models in their multiple flavors (AR, 
ARMA, ARIMA, etc.) have been widely used for time series 
modeling and forecasting in fields like macro-economics or 
market analysis. The basic idea is to build up a model based 
on the history of the sampled data, i.e., on its correlation 
structure. Many environmental and physical quantities can be 
modeled through AR, and hence these models are specially 
indicated for WSN monitoring applications. When used for 
signal compression AR obtains a model from the input data 
and sends it to the receiver in place of the actual time series. 
The reconstructed model is thus used at the data collection 
point (the sink) for data prediction until it receives model 
updates from the sensor nodes. Specifically, each node locally 
verifies the accuracy of the predicted data values with respect 
to the collected samples. If the accuracy is within a prescribed 
error tolerance, the node assumes that the sink can rebuild the 
data correctly and it does not transmit any data. Otherwise, it 
computes a new model and communicates the corresponding 
parameters to the sink. 

Adaptive Auto-Regressive Moving Average (A- 
ARMA) [21]: the basic idea of A- ARMA JH is that of 
having each sensor node compute an ARMA model based 
on fixed-size windows of N' < N consecutive samples. 
Compression is achieved through the transmission of the 
model parameters to the sink in place of the original data, 
as discussed above. In order to reduce the complexity in the 
model estimation process, adaptive ARMA employs low-order 
models, whereby the validity of the model being used is 
checked through a moving window technique. 

The algorithm works as follows. First, once a WSN node 
has collected N' samples starting from sample n, builds an 
ARMA Model M (n) = ARMA(p,q,N',ri), where the order of 
p and q of the ARMA process, and the window length N' 
must be fixed a priori. For this model, the current estima- 
tion window goes from step n to step n + N' — 1 (covering 
samples {x(n), . . . , x(n + N' - 1)}). Upon collecting the 
subsequent K samples, M (n) is used to obtain the predicted 
values {x(n + N'), x(n + N' + K — 1)}. Thus, the RMS error 
between predicted and actual values is computed. If this error 
is within the allowed error tolerance, the sensor node keeps 
using its current ARMA model for the next K values, i.e., its 
prediction window is moved K steps to the right, covering steps 
n + N' + K - 1 to n + N' + 2K - 1 . In this case, the decoder at the 
WSN sink uses M (n) to reconstruct the signal from step n + N' 
to n + N' + K - 1 obtaining {x(n + N'), . . . , x(n + N' + K - 1)}. 
However, if the target tolerance is not met, the node moves its 
window and recomputes the new ARMA model parameters 
using the most recent N' samples. 

Modified Adaptive Auto-Regressive (MA-AR): the A- 

ARMA algorithm was designed with the main objective of re- 
ducing the complexity in the model estimation process. For 
each model update, only one estimation is performed at the be- 
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ginning of the stage, and always over a fixed window of N' 
samples. A drawback of this approach is that, especially for 
highly noisy environments, the estimation over a fixed window 
can lead to poor results when used for forecasting. 

In order to avoid this, we hereby propose a modified version 
of A-ARMA which uses a p-order AR model, dividing the time 
in terms of prediction cycles, whose length N' is variable and 
adapts to the characteristics of the signal. Let n be the the time 
index at the beginning of a prediction cycle. The first p col- 
lected samples {x{n), . . . , x(n + p — 1)} must be encoded and 
transmitted; these will be used at the receiver to initialize the 
predictor. Upon collecting sample x(n + p), the p parameters 
of a AR model M (n l) = AR(n,p, 1) are computed, where n is 
the starting point of the estimation window and N' = p + 1 is 
its window size, i.e., the support points for the estimation are 
{x(n), . . . , x(n + p)}). M (K,I) is thus used to predict x(n + p), con- 
sidering {x(n), . . . ,x(n + p - 1)} as initial values. If the target 
tolerance is met, that is, if \x(n +p) — x(n + p)\ < s, the model is 
temporally stored as valid. When the next sample x(n + p + 1) 
becomes available, a new model M ( "' 2) = AR(n, /?, 2) is obtained 
over the new estimation window [x(ri), . . . , x(n + p + l)j of size 
N' = p + 2. Then, M ( "' 2) is used to predict x(n + p) (one-step 
ahead) and x(n +p + 1) (two-step ahead), with the model M ( " ,2) 
initialized with the values [x(ri), . . . , x(n + p — 1)}, and predicted 
values are compared with the real samples to check whether 
\x(n + p + i) - x(n + p + z)| < s for i = 0,1. This process 
is iterated until, for some value k > 1, M ( "'^ is no longer ca- 
pable of meeting the target reconstruction accuracy for at least 
one sample: that is, when \x(n + p + i) - x(n + p + i)\ > e 
for at least one i e {0, . . ., k — 1}. In this case, the last valid 
model M(" ,k ~ l \ with k — 1 > 1, is encoded and transmitted to 
the decoder at the receiver side, where M <fl ' k ~ r> is initialized 
with {x(n), . . . ,x(n + p - 1)} and used to obtain the estimates 
{x(n + p), . . . , x(n + p + k - 2)}. The length of the final estima- 
tion window \&N' = p + k - 1 . At this point, a new prediction 
cycle starts with sample x(n + p + k - 1) and the new model 

The key point in our algorithm is the incremental estimation 
of the AR parameters: only the contribution of the last sample is 
considered at each iteration in order to refine the AR model. In 
this way, the computational cost for recomputing a new model 
for each sample is minimized. The AR parameters can be ob- 
tained through least squares minimization. The one-step ahead 
predictor for an AR process is defined as: 

x(n) = £,\x(n — 1) + ■ • ■ + £, p x(n — p) . 

To simplify the notation, in the following without loosing gen- 
erality, we assume n — 0. The least squares method minimizes 
the total error & defined as the sum of the individual errors of 
the one-step ahead predictor for each of the N' samples in the 
estimation window: 

N' N' 
i=p i=p 



Minimizing for each ^ yields a set of equations: 

88 N ' 

— = -2 Y WO - - 1) + • • • + { P x(i - p))]x(i -k) = 0, 

with k = 1,2,...,/?, which can be expressed in terms of the 
following linear system of equations: 
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where f(r, s) = Yif= p x _ r ) JC ( ! _ s )- This is a linear system of p 
equations and p unknown values (the coefficients) that can be 
solved through a standard Gaussian elimination method. Each 
entry of the matrix involves N' - p multiplications and N' - p 
additions. Thus, the estimation of the AR model has complexity 
0(p 2 (N' -p)) associated with the matrix construction and 0(p 3 ) 
for solving the linear system ([1}. Note that, for signals with 
high temporal correlation N' - pis usually much larger than the 
AR order p (p < 5 to bound the computational complexity). In 
this case, the dominating term is 0{p 2 (N' - p)) and increases 
with increasing A^' and thus, with increasing correlation length, 
as we will show shortly in Section[3] 

2.2. Compression Methods Based on Fourier Transforms 

For Fourier-based techniques, compression is achieved 
through sending subsets of the FFT or DCT transformation co- 
efficients. Below, we propose some possible methods that differ 
in how the transformation coefficients are picked. 

2.2.1. Fast Fourier Transform (FFT) 

The first method that we consider relies on the simplest way 
to use the Fourier transform for compression. Specifically, the 
input time series x(n) is mapped to its frequency representation 
X(f) € C through a Fast Fourier Transform (FFT). We define 
X K {f) = 9U{X(J)}, and X T (f) = 3m{X(f)} as the real and 
the imaginary part of X(f), respectively. Since x(n) is a real- 
valued time series, X(f) is Hermitian, i.e., X(-f) = X(f). This 
symmetry allows the FFT to be stored using the same number 
of samples of the original signal. For even we take / € 
{fx, ... , /n/i) for both Xr{-) and Xj(-), while if Af is odd we take 
/ e {/i , . . . , / L iv/2j + i ) for the real part and / e {/],... , / L2V/2 j } for 
the imaginary part. 

The compressed representation X(f) = Xjt(f) + jXj{ f) will 
also be in the frequency domain and it is built (for the case of 
Af even) as follows: 

1 . initialize X K (f) = and X T (f) = 0, V / e {/,,.. . , f m ] ; 

2. select the coefficient with maximum absolute value from 
Xft and Xj, i.e., /* = argmaxy max{|X^(/)|, and 
M 4 argmax I£{ «, J| {|X i (r)|}. 

3. set X M (n = X M (fl and then set X M {f) = 0. 

4. if x(n), the inverse FFT of X(f), meets the error tolerance 
constraint continue, otherwise repeat from step 2.; 

5. encode the values and the positions of the harmonics 
stored in Xr and Xj. 

Hence, the decompressor at the receiver obtains X^if) and 
Xj(f) and exploits the Hermitian symmetry to reconstruct X(f). 



2.2.2. Low Pass Filter (FFT-LPF) 

We implemented a second FFT-based lossy algorithm, which 
we have termed FFT-LPF. Since the input time series x(n) are 
in may common cases slow varying signals (i.e., having large 
temporal correlation) with some high frequency noise superim- 
posed, most of the significant coefficients of X(f) reside in the 
low frequencies. For FFT-PLF, we start setting X<n(f) = for 
all frequencies. Thus, Xft(f) is evaluated from f , incrementally 

moving toward higher frequencies, fi, fi, At each iteration 

U Xnifd is copied onto X^f) (both real and imaginary part), 
the inverse FFT is computed taking Xr(/) as input and the er- 
ror tolerance constraint is checked on the so obtained x(n). If 
the given tolerance is met the algorithm stops, otherwise it is 
reiterated for the next frequency f + \ . 

2.2.3. Windowing 

The two algorithms discussed above suffer from an edge dis- 
continuity problem. In particular, when we take the FFT over a 
window of N samples, if x(l) and x(N) differ substantially the 
information about this discontinuity is spread across the whole 
spectrum in the frequency domain. Hence, in order to meet the 
tolerance constraint for all the samples in the window, a high 
number of harmonics is selected by the previous algorithms, 
resulting in a poor compression and in a high number of opera- 
tions. 

To solve this issue, we implemented a version of the FFT 
algorithm that considers overlapping windows of N + 2W sam- 
ples instead of disjoint windows of length N, where W is the 
number of samples that overlap between subsequent windows. 
The first FFT is taken over the entire window and the selection 
of the coefficients goes on depending on the selected algorithm 
(either FFT or FFT-LPF), but the tolerance constraint is only 
checked on the N samples in the central part of the window. 
With this workaround we can get rid of the edge discontinu- 
ity problem and encode the information about the N samples 
of interest with very few coefficients as it will be seen shortly 
in Section [3] As a drawback, the direct and inverse transforms 
have to be taken on longer windows, which results in a higher 
number of operations. 

2.2.4. Discrete Cosine Transform (DCT) 

We also considered the Discrete Cosine Transform (type II), 
mainly for three reasons: 1) its coefficients are real, so we did 
not have to cope with real and imaginary parts, thus saving 
memory and number of operations; 2) it has a strong "energy 
compaction" property 112411 . i.e., most of the signal information 
tends to be concentrated in a few low-frequency components; 
3) the DCT of a signal with N samples is equivalent to a DFT 
on a real signal of even symmetry with double length, so DCT 
does not suffer from the edge discontinuity problem. 

3. Performance Comparison 

The objects of our discussion in this section are: 

• to provide a thorough performance comparison of the 
compression methods of Section [2] The selected perfor- 
mance metrics are: 1) compression ratio, 2) computational 



and transmission energy and 3) reconstruction error at the 
receiver, which are defined below; 

• to assess how the statistical properties of the selected sig- 
nals impact the performance of the compression methods; 

• to investigate whether or not data compression leads to en- 
ergy savings in single- and multi-hop scenarios for: WSN) 
a wireless sensor network and UWN) an underwater net- 
work. 

• to obtain, through numerical fitting, close-formulas which 
model the considered performance metrics as a function of 
key parameters. 

Toward the above objectives, we present simulation re- 
sults obtained using synthetic signals with varying correlation 
length. These signals make it possible to give a fine grained de- 
scription of the performance of the selected techniques, looking 
comprehensively at the entire range of variation of their tempo- 
ral correlation statistics. Real datasets are used to validate the 
proposed empirical fitting formulas. 

3.1. Performance Metrics 

Before delving into the description of the results, in the fol- 
lowing we give some definitions. 

Definition 1. Correlation length 

Given a stationary discrete time series x(n) with n 
1,2...., Y, we define correlation length of x(n) the small- 
est value n* such that the autocorrelation function of x(n) is 
smaller than a predetermined threshold 6. The autocorrelation 
is: 

E [(x(m) - fi x ) (x(m + n) - p x )] 

where fi x and cr 2 x are the mean ad the variance of x(n), respec- 
tively. Formally, n* is defined as: 

n* = argmin {p x (ri) < 6} . 

n>0 

Definition 2. Compression ratio 

Given a finite finite time series x(n) and its compressed version 
x(n), we define compression ratio i] the quantity: 

= 

71 N b (x) ' 

where Nb(x) and Nh(x) are the number of bits used to repre- 
sent the compressed time series x(n) and the original one x(n), 
respectively. 

Definition 3. Energy consumption for compression 
For every compression method we have recorded the number 
of operations to process the original time series x(n) account- 
ing for the number of additions, multiplications, divisions and 
comparisons. Thus, depending on selected hardware architec- 
ture, we have mapped these figures into the corresponding num- 
ber of clock cycles and we have subsequently mapped the latter 
into the corresponding energy expenditure, which is the energy 
drained from the battery to accomplish the compression task. 
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Definition 4. Transmission Energy 

Is the energy consumed for transmission, obtained accounting 
for the radio chip characteristics and the protocol overhead due 
to physical (PHY) and medium access (MAC) layers. 

Definition 5. Total Energy Consumption 

Is the sum of the energy consumption for compression and 
transmission and is expressed in [Joule]. 

In the computation of the energy consumption for compres- 
sion, we only accounted for the operations performed by the 
CPU, without considering the possible additional costs of read- 
ing and writing in the flash memory of the sensor. For the 
communication cost we have only taken into consideration the 
transmission energy, neglecting the cost of turning on and off 
the radio transceiver and the energy spent at the destination to 
receive the data. The first are fixed costs that would also be in- 
curred without compression, while the latter can be ignored if 
the receiver is not a power constrained device. Moreover, for 
the MAC we do not consider retransmissions due to channel 
errors or multi-user interference. 

3.2. Hardware Architecture 

For both the WSN and the UWN scenarios we selected the 
TI MSP430 1I25I1 micro-controller using the corresponding 16 
bit floating point package for the calculations and for the data 
representation. In the active state, the MSP430 is powered by a 
current of 330 pA at 2.2 V and it has a clock rate of 1 MHz. The 
resulting energy consumption per CPU cycle is Eq = 0.726 nJ. 
In Table Q] the number of clock cycles needed for the floating 
point operations are listed. 



Operation 


Clock cycles 


Addition 


184 


Subtraction 


177 


Multiplication 


395 


Division 


405 


Comparison 


37 



Table 1 : CPU Cycles for the TI MSP430 micro-controller, see Section 5 of (Hi. 

For the WSN scenario, we selected the TI CC2420 RF 
transceiver ll26ll . an IEEE 802.15.4 ll27ll compliant radio. The 
current consumption for the transmission is 17.4 mA at 3.3 V, 
for an effective data rate of 250 kbps. Thus, the energy cost 
associated with the transmission of a bit is E' T = 230 nJ, 
which equals the energy spent by the micro-processor during 
316 clock cycles in the active state. 

For the UWN scenario, we considered the Aquatec AquaMo- 
dem [28], an acoustic modem featuring a data rate up to 2000 
bps consuming a power of 20 W. In this case, the energy spent 
for the transmission of one bit of data is E' Tx = 10 mJ. We 
remark that the same amount of energy is spent by the micro- 
processor during 13 • 10 6 clock cycles. 

3.3. Generation of Synthetic Stationary Signals 

The stationary synthetic signals have been obtained through 
a known method to enforce the first and second moments to a 



white random process, see [29][30]. Our objective is to obtain 
a random time series x(ri) with given mean p x , variance cr 2 and 
autocorrelation function p x {n). The procedure works as follow: 

1. A random Gaussian series G(k) with k = 1,2, ...,N is 
generated in the frequency domain, where N is the length 
of the time series x(n) that we want to obtain. Every ele- 
ment of G(k) is an independent Gaussian random variable 
with mean pc — and variance <x 2 — 1 . 

2. The Discrete Fourier Transform (DFT) of the autocorrela- 
tion function p x (n) is computed, S x (k) = T[p x (n)~\, where 
T\-\ is the DFT operator. 

3. We compute the entry- wise product X(k) - G(k) o S x (k)^. 

4. The correlated time series x(n) is finally obtained as 
T- l [X(k)]. 

This is equivalent to filter a white random process with a linear, 
time invariant filter, whose transfer function is T~ l [S x (k)~-~\. 
The stability of this procedure is ensured by a suitable choice 
for the correlation function, which must be square integrable. 
For the simulations in this paper we have used a Gaussian corre- 
lation function [31], i.e., p x (n) = expf-an 2 }, where a is chosen 
in order to get the desired correlation length n* as follows: 

a (n*) 2 ■ 

Without loss of generality, we generate synthetic signals with 
p x = and cr 2 = 1 . In fact, applying an offset to the generated 
signals and a scale factor does not change the resulting correla- 
tion. For an in deep characterization of the Gaussian correlation 
function see lf3lll . 

Also, in order to emulate the behavior of real WSN signals, 
we superimpose a noise to the synthetic signals so as to mimic 
random perturbations due to limited precision of the sensing 
hardware and random fluctuations of the observed physical phe- 
nomenon. This noise is modeled as a zero mean white Gaussian 
process with standard deviation cr noise . 

3.4. Simulation Setup 

For the experimental results of the following Sections 13.51 
and 13.61 we used synthetic signals with correlation length n* 
varying in jl, 10, 20, 50, . . . , 500} time slots, where after 20, n* 
varies in steps of 30 (we have picked 5 - 0.05 for all the results 
shown in this paper). We consider time series of N - 500 sam- 
ples (time slots) at a time, progressively taken from a longer 
realization of the signal, so as to avoid artifacts related to the 
generation technique. Moreover, a Gaussian noise with stan- 
dard deviation cr noise = 0.04 has been added to the signal, as 
per the signal generation method of Section 1331 For the recon- 
struction accuracy, the absolute error tolerance has been set to 
e = £cr noise , with £ > 0. In the following graphs, each point 
is obtained by averaging the outcomes of 10 4 simulation runs. 
For a fair comparison, the same realization of the input signal 
x(n) has been used for all the compression methods, for each 
simulation run and value of n* . 
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3.5. Compression Ratio vs Processing Energy 

In the following, we analyze the performance in terms 
of compression effectiveness and computational complexity 
(energy) for the lossy compression methods of Section [2] 

Adaptive Modeling Methods: in this first set of results we 
compare the performance of the following compression meth- 
ods: 1) Modified Adaptive Autoregressive (M-AAR); 2) Poly- 
nomial Regression (PR); 3) Piecewise Linear Approximation 
(PLAMLiS); 4) Enhanced Piecewise Linear Approximation (E- 
PLAMLiS) and 5) Lightweight Temporal Compression (LTC). 
For the M-AAR autoregressive filter and the polynomial regres- 
sion (PR) we used four different orders, namely, p = {2, 3, 4, 5}. 

Fig. |3(a)| shows the Compression Ratio achieved by the five 
compression methods as a function of the correlation length «*. 
These results reveal that for small values of n* the compression 
performance is poor for all compression schemes, whereas it 
improves for increasing correlation length, by reaching a floor 
value for sufficiently large n* . This confirms that n* is a key 
parameter for the performance of all schemes. Also, the com- 
pression performance differs among the different methods, with 
PR giving the best results. This reflects the fact that, differently 
from all the other methods, PR approximates x(n) without re- 
quiring its fitting curves to pass from the points of the given 
input signal. This entails some inherent filtering, that is embed- 
ded in this scheme and makes it more robust against small and 
random perturbations. 

Fig. |3(b)| shows the energy consumption for compression. 
For increasing values of n* the compression ratio becomes 
smaller for all schemes, but their energy expenditure substan- 
tially differs. Notably, the excellent compression capabilities of 
PR are counterbalanced by its demanding requirements in terms 
of energy. M-AAR and PLAMLiS also require a quite large 
amount of processing energy, although this is almost one order 
of magnitude smaller than that of PR. LTC and E-PLAMLiS 
have the smallest energy consumption among all schemes. 

We now discuss the dependence of the computational 
complexity (which is strictly related to the energy spent for 



compression) on n*. LTC encodes the input signal x(n) incre- 
mentally, starting from the first sample and adding one sample 
at a time. Thus, the number of operations that it performs 
only weakly depends on the correlation length and, in turn, the 
energy that it spends for compression is almost constant with 
varying «*. E-PLAMLiS takes advantage of the increasing 
correlation length: as the temporal correlation increases, this 
method has to perform fewer "divide and reiterate" steps, so the 
number of operations required gets smaller and, consequently, 
also the energy spent for compression. For the remaining 
methods the complexity grows with n* . For PLAMLiS, this 
is due to the first step of the algorithm, where for each point 
the longest segment that respects the given error tolerance has 
to be found, see Section [2] When x(n) is highly correlated, 
these segments become longer and PLAMLiS has to check 
a large number of times the tolerance constraint for each of 
the N samples of x(n). For M-AAR and PR every time a new 
sample is added to a model (autoregressive for the former and 
polynomial for the latter), this model must be updated and the 
error tolerance constraint has to be checked. These tasks have 
a complexity that grows with the square of the length of the 
current model. Increasing the correlation length of the input 
time series also increases the length of the models, leading 
to smaller compression ratios and, in turn, a higher energy 
consumption. 

Fourier-based Methods: we now analyze the performance of 
the Fourier-based compression schemes of Section [2] We con- 
sider the same simulation setup as above. Fig. |4(a)| shows that 
also with Fourier-based methods the compression performance 
improves with increasing «*. The methods that perform best 
are FFT Windowed, FFT-LPF Windowed and DCT-LPF, which 
achieve very small compression ratios, e.g., rj is around 10" 2 
for n* > 300. Conversely, FFT and FFT-LPF, due to their edge 
discontinuity problem (see Section|2]l, need to encode more co- 
efficients to meet the prescribed error tolerance constraint and 
thus their compression ratio is higher, i.e., around 10 _1 . The en- 
ergy cost for compression is reported in Fig. |4(b)j where ri* is 
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Figure 4: (a) rj vs Correlation Length n* and (b) vs Energy consumption for compression for the Fourier-based methods for fixed e = 4cr no j se . 
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Figure 5: Compression Ratio vs Total Energy Consumption for the two single-hop scenarios considered: (a) WSN and (b) UWN. 



varied as an independent parameter. The compression cost for 
these schemes is given by a first contribution, which represents 
the energy needed to evaluate the FFT/DCT of the input signal 
x(n). Thus, there is a second contribution which depends on the 
number of transformation coefficients that are picked. Specif- 
ically, a decreasing n* means that the signal is less correlated 
and, in this case, more coefficients are to be considered to meet 
a given error tolerance. Further, for each of them, an inverse 
transform has to be evaluated to check whether an additional 
coefficient is required. This leads to a decreasing computational 
cost for increasing n* . 



As a last observation, we note that FFT-based methods 
achieve the best performance in terms of compression ratio 
among all schemes of Figs. |3(b)| and |4(b)| (DCT-LPF is the 
best performing algorithm), whereas PLA schemes give the best 
performance in terms of energy consumption for compression 
(LTC is the best among them). 



3.6. Application Scenarios 

As discussed above, we evaluated the selected compression 
methods considering the energy consumed for transmission 
of typical radios in Wireless Sensor Networks (WSN) and an 
Underwater Networks (UWN). In the following, we discuss the 
performance for these application scenarios in single- as well 
as multi-hop networks. 

Single-hop Performance: Fig. [5] shows the performance in 
terms of Compression Ratio 77 vs Total Energy Consumption 
for a set of compression methods when applied in the two se- 
lected application scenarios. PLAMLiS was not considered as 
its performance is always dominated by E-PLAMLiS and we 
only show the performance of the best Fourier-based schemes. 
In both graphs the large white dot represent the case where no 
compression is applied to the signal, which is entirely sent to 
the gathering node. Note that energy savings can only be ob- 
tained for those cases where the total energy lies to the left of 
the no compression case. 



9 




Figure 6: Energy Gain vs Correlation Length r\* for the two single-hop scenarios considered: (a) WSN and (b) UWN. 



In the WSN scenario, the computational energy is compara- 
ble to the energy spent for transmission, thus, only LTC and 
Enhanced PLAMLiS can achieve some energy savings (see 
Fig |5(a)) >. All the other compression methods entail a large 
number of operations and, in turn, perform worse than the no 
compression case in terms of overall energy expenditure. For 
the UWN scenario, the energy spent for compression is always 
a negligible fraction of the energy spent for transmission. For 
this reason, every considered method provides some energy 
savings, which for PR and Fourier methods can be substantial. 

The total energy gain, defined as the ratio between the energy 
spent for transmission in the case with no compression and the 
total energy spent for compression and transmission using the 
selected compression techniques, is shown in Fig. [6] 

In the WSN scenario, i.e., Fig |6(a)| the method that offers 
the highest energy gain is LTC, although other methods such 
as DCT-LPF can achieve better compression performance 
(see Fig |5(a)| i. Note that in this scenario the total energy is 
highly influenced by the computational cost. Thus, the most 
lightweight methods, such as LTC and enhanced PLAMLiS, 
perform best. In the UWN case, whose results are shown in 
Fig |6(b)| the computational cost is instead negligible with 
respect to the energy spent for transmission. As a consequence, 
the energy gain is mainly driven by the achievable compression 
ratio and the highest energy gain is obtained with DCT-LPF. 
In this scenario, PR, which is computationally demanding, can 
lead to large energy savings too, whereas the energy gain that 
can be obtained with more lightweight schemes, such as LTC, 
is quite limited. 

Multi-hop Performance: in Fig. [7] we focus on multi-hop net- 
works, and evaluate whether further gains are possible when 
the compressed information has to travel multiple hops to reach 
the data gathering point. Both WSN and UWN scenarios are 
considered. In this case, both transmitting and receiving en- 
ergy is accounted for at each intermediate relay node. Only 
LTC and DCT-LPF are shown, as these are the two methods 
that respectively perform best in the WSN and UWN scenarios. 



Their performance is computed by varying the error toleranc e 
e e \3o-„ oise ,4o- nojse ,5cr no i se }, whereas the correlation length is 
fixed to n* = 300. 

For the WSN scenario the energy gain increases with the 
number of hops for both compression schemes. As we have 
already discussed, in this case the energy spent for the com- 
pression at the source node is comparable to the energy spent 
for the transmission. The compression cost (compression en- 
ergy) is only incurred at the source node, whereas each addi- 
tional relay node only needs to send the compressed data. This 
leads to an energy gain that is increasing with the number of 
hops involved. We also note that DCT-LPF is not energy effi- 
cient in single-hop scenarios, but it can actually provide some 
energy gains when the number of hops is large enough (e.g., 
larger than 2 for s e {4cr no!je , 5cr„ oiSe }, see Fig. |7(a)| >. 

Conversely, in the UWN scenario the energy spent for com- 
pression is a negligible fraction of the energy spent for transmis- 
sion. Henceforth, the overall energy gain over multiple hops is 
nearly constant and equal to the energy savings achieved over 
the first hop. 

3. 7. Numerical Fittings 

In the following, we provide close-formulas to accurately re- 
late the achievable compression ratio rj to the relative error tol- 
erance £ and the computational complexity, N c , which is ex- 
pressed in terms of number of clock cycles per bit to compress 
the input signal x(n). These fittings have been computed for the 
best compression methods, namely, LTC and DCT-LPF. 

Note that until now we have been thinking of 77 as a perfor- 
mance measure which depends on the chosen error tolerance 
s = ^cr tW i se . This amounts to considering £ as an input pa- 
rameter for the compression algorithm. In the following, we 
approximate the mathematical relationship between r\ and £ by 
conversely thinking of £ as a function of r\, which is now our 
input parameter. N c can as well be expressed as a function of 77. 

We found these relationships through numerical fitting, run- 
ning extensive simulations with synthetic signals. The rela- 
tive error tolerance £ can be related to the compression ratio 
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Figure 8: Fitting functions f(n* ,rf) vs experimental results: (a) LTC, (b) DTC-LPF. 



77 through the following formulas: 



PiT + Pi 1 ! + Pi 



i] + qi 

pin 4 + pvf + pin 2 + p 4 Tj + p 5 



i] + qi 



LTC 



DCT-LPF , 



(2) 



where the fitting parameters pi,p2,pi, Pa, Ps, and q\ depend on 
the correlation length n* and are given in Table [2] for LTC and 
DCT-LPF. These fitting formulas have been validated against 
real world signals measured from the environmental monitor- 
ing WSN testbed deployed on the ground floor of the Depart- 
ment of Information Engineering (DEI), University of Padova, 
Italy [32]. This dataset consists of measures of temperature 
and humidity, sensed with a sampling interval of 1 minute for 
6 days. Correlation lengths are n£ = 563 and = 355 for 
temperature and humidity signals, respectively. The empirical 
relationships of Eq. (fJJ are shown in Fig. |8(a)| and |8(b)| through 
solid and dashed lines, whereas the markers indicate the per- 
formance obtained applying LTC and DCT-LPF to the consid- 



ered real datasets. As can be noted from these plots, although 
the numerical fitting was obtained for synthetic signals, Eq. (fJJ 
closely represents the actual tradeoffs. Also, with decreasing 
n* the curves relating £ to 77 remain nearly unchanged in terms 
of functional shape but are shifted toward the right. Finally, 
we note that the dependence on n* is particularly pronounced 
at small values of «*, whereas the curves tend to converge for 
increasing correlation length (larger than 1 10 in the figure). 

For the computational complexity, we found that N c scales 
linearly with 77 for both LTC and DCT-LPF. Hence, A^ c can be 
expressed through a polynomial as follows: 

N c (n*, rj)-ari + yn* + /3 . 

A^e exhibits a linear dependence on both n* and 77; the fitting 
coefficients are shown in Table [3] Note that the dependence on 
n* is much weaker than that on 77 and for practical purposes can 
be neglected without loss of accuracy. In fact, for DCT-LPF 
there is a one-to-one mapping between any target compression 
ratio and the number of DCT coefficients that are to be sent to 
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-0.92752 


-3.07506 


3.07560 
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0.02898 
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DCT-LPF 


50 


-1.90344 


-0.17491 


-0.13500 


2.43821 


-0.03826 


-0.03929 


80 


-2.59629 


1.41404 


-1.40970 
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-0.04122 


-0.02667 
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-2.57150 
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-1.51646 


2.87138 


-0.02747 


-0.01913 
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Table 2: Fitting coefficients for f (n* , rj). 
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DCT-LPF 


48.1 • 10 3 


82.3 
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Table 3: Fitting coefficients for N c (n* , rj). 

achieve this target performance (the computational complexity 
is directly related to this number of coefficients). Note that, 
differently from Fig. |4] this reasoning entails the compression 
of our data without fixing beforehand the error tolerance s. For 
LTC, the dominating term in the total number of operations per- 
formed is rj, as this term is directly related to the number of seg- 
ments that are to be processed. For this reason, in the remainder 
of this section we consider the simplified relationship: 



N c (t]) = ar]+/3. 



(3) 



The accuracy of Eq. © is verified in Fig. [9] where we plot our 
empirical approximations against the results obtained for the 
real world signals described above. The overall energy con- 
sumption is obtained as Nb(x)N c (r])Eo. 

Tradeoffs: in the following, we use the above empirical formu- 
las to generalize our results to any processing and transmission 
technology, by separating out technology dependent and algo- 
rithmic dependent terms. Specifically, a compression method is 
energy efficient when the overall cost for compression (E c (x)) 
and transmission of the compressed data (Et x (x)) is strictly 
smaller than the cost that would be incurred in transmitting 
x(n) uncompressed (E Tx (x)). Mathematically, E c (x) + E Tx (x) < 
Etx(x). Dividing both sides of this inequality by Et x (x) and 
rearranging the terms leads to: 



E Tx (x) E' Tx N b (x) 



1 



E c (x) E N c N b (x) 1 - 77 

where the energy for transmission Ej x (x) is expressed as the 
product of the energy expenditure for the transmission of a bit 
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Figure 9: Fitting functions N c (rj) vs experimental results. 



E' and the number of bits of x(n), N b {x). The energy for com- 
pression is decomposed in the product of three terms: 1) the 
energy spent by the micro-controller in a clock cycle Eq, 2) the 
number of clock cycles performed by the compression algo- 
rithm per (uncompressed) bit of x(n), N c and 3) the number of 
bits composing the input signal x(n), Nb(x). With these energy 
costs and the above fitting Eq. (f3) for N c we can rewrite the 
above inequality so that the quantities that depend on the se- 
lected hardware architecture appear on the left hand side, leav- 
ing those that depend on algorithmic aspects on the right hand 
side. The result is: 



£z 

E 



Tx > NM 



i 



arj +j3 
1 



(4) 



where a and /3 are the algorithmic dependent fitting parame- 
ters indicated in Table [3] Eq. can be used to assess whether 
a compression scheme is suitable for a specific device archi- 
tecture. A usage example is shown in Fig. [10] In this graph, 
the curves with markers are obtained plotting the right hand 
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Figure 10: Energy savings assessment vs r\ and hardware architecture. 



mance. These have been validated against real datasets and can 
be used to assess the effectiveness of the selected compression 
schemes for further hardware architectures. 
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side of Eq. (|4]i for r\ < 1, whereas the lines refer to the ex- 
pression on the left hand side of Eq. © for our reference sce- 
narios, i.e., WSN (solid line) and UWN (dashed line). In the 
WSN scenario, E' Tx = 230 nJ for the selected CC2420 ra- 
dio, whereas for the TI MSP430 we have E = 0.726 nJ and 
their ratio is E' T /Eq =* 316. The graph indicates that, in this 
case, DCT-LPF is inefficient for any value of 77, whereas LTC 
provides energy savings for 77 < 0.6, that using the function 
£(n* , if) for LTC can be translated into the corresponding (ex- 
pected) error performance. Note that the knowledge of n* is 
needed for this last evaluation. Conversely, for the UWN sce- 
nario, where E' Tx = 10 mW and E = 0.726 nJ, both DCT- 
LPF and LTC curves lie below the hardware dependent ratio 
E' T /Eq - 13 • 10 6 , indicating that energy savings are achievable 
by both schemes for almost all r\. These results can be gener- 
alized to any other device technology, by comparing the right 
hand side of Eq. <j4j against the corresponding ratio E' T JEq and 
checking whether Eq. © holds. 

4. Conclusions 

In this paper we have systematically compared lossy com- 
pression algorithms for constrained sensor networking, by in- 
vestigating whether energy savings are possible depending on 
signal statistics, compression performance and hardware char- 
acteristics. Our results revealed that, for wireless transmission 
scenarios, the energy required by compression algorithms is of 
the same order of magnitude of that spent in the transmission at 
the physical layer. In this case, the only class of algorithms that 
provides some energy savings is that based on piecewise lin- 
ear approximations, as these algorithms have the smaller com- 
putational cost. In addition, we have also considered under- 
water channels which, due to the nature of the transmission 
medium, require more energy demanding acoustic modems. In 
this scenario, techniques based on Fourier transforms are the 
algorithms of choice, as these provide the highest compression 
performance. Finally, we have obtained fitting formulas for the 
best compression methods to relate their computational com- 
plexity, approximation accuracy and compression ratio perfor- 
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